Performance of the IBM large vocabulary continuous speech recognition system on the ARPA Wall Street Journal task

نویسندگان

  • Lalit R. Bahl
  • S. Balakrishnan-Aiyer
  • Jerome R. Bellegarda
  • Martin Franz
  • Ponani S. Gopalakrishnan
  • David Nahamoo
  • Miroslav Novak
  • Mukund Padmanabhan
  • Michael Picheny
  • Salim Roukos
چکیده

In this paper we discuss various experimental results using our continuous speech recognition system on the Wall Street Jounal task. Experiments with diierent feature extraction methods, varying amounts and type of training data, and diierent vocabulary sizes are reported.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The RWTH large vocabulary continuous speech recognition system

In this paper, we present an overview of the RWTH Aachen large vocabulary continuous speech recognizer. The recognizer is based on continuous density hidden Markov models and a time-synchronous left-to-right beam search strategy. Experimental results on the ARPA Wall Street Journal (WSJ) corpus verify the effects of several system components, namely linear discriminant analysis, vocal tract nor...

متن کامل

The Rwth Speech Recognition System and Spoken Document Retrieval

In this paper, we present an overview of the RWTH Aachen large vocabulary continuous speech recognizer. The recognizer is based on continuous density hidden Markov models and a time-synchronous left-to-right beam search strategy. Experimental results on the ARPA Wall Street Journal (WSJ) corpus verify the effects of several system components, namely linear discriminant analysis, vocal tract nor...

متن کامل

Transcribing broadcast news shows

While significant improvements have been made over the last 5 years in large vocabulary continuous speech recognition of large read-speech corpora such as the ARPA Wall Street Journal-based CSR corpus (WSJ) for American English and the BREF corpus for French, these tasks remain relatively artificial. In this paper we report on our development work in moving from laboratory read speech data to r...

متن کامل

Issues in Large Vocabulary, Multilingual Speech Recognition

In this paper we report on our activities in multilingual, speaker-independent,large vocabulary continuous speech recognition. The multilingual aspect of this work is of particular importance in Eu-rope, where each country has its own national language. Our existing recognizer for American English and French, has been ported to British English and German. It has been assessed in the context of ...

متن کامل

On designing pronunciation lexicons for large vocabulary, continuous speech recognition

Creation of pronunciation lexicons for speech recognition is widely acknowledged to be an important, but labor-intensive, aspect of system development. Lexicons are often manually created and make use of knowledge and expertise that is difficult to codify. In this paper we describe our American English lexicon developed primarily for the ARPA WSJ/NAB tasks. The lexicon is phonemically represent...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1995